Lexical Chains on WordNet and Extensions
نویسندگان
چکیده
Lexical chains between two concepts are sequences of semantically related words interconnected via semantic relations. This paper presents a new approach for the automatic construction of lexical chains on knowledge bases. Experiments were performed building lexical chains on WordNet, Extended WordNet, and Extended WordNet Knowledge Base. The research addresses the problems of lexical chains ranking and labeling them with appropriate semantic names. Introduction to Lexical Chains Lexical chains are sequences of semantically related words interconnected via semantic relations. They establish semantic connectivity between two end concepts. Lexical chains are constructed on knowledge bases that contain concepts and relations between concepts. In this paper, lexical chains are built on three resources: WordNet (WN), eXtended WordNet (XWN), and eXtended WordNet Knowledge Base (XWN KB). Each of these resources can be viewed as a large semantic graph. Finding lexical chains consists in finding paths between concepts. In general, there are many possible chains between two concepts. For example, for pair person – teach there are at least two useful lexical chains, giving different interpretations of connectivity: 1. person : n#1 ISA −1 −−−→ enrollee : n#1 ISA −1 −−−→ student : n#1 DERIVATION −−−−−−→ educate : n#1 DERIVATION −−−−−−→ education : n#1 DERIVATION −−−−−−→ teach : v#1 , where a person is the beneficiary of teaching, and 2. person : n#1 ISA −1 −−−→ leader : n#1 ISA −1 −−−→ trainer : n#1 DERIVATION −−−−−−→ train : v#1 DERIVATION −−−−−−→ education : n#1 DERIVATION −−−−−−→ teach : v#1, where a person is doing the teaching. ISA−1 (ie hyponymy) and DERIVATION are WordNet relations. There are also meaningless chains which need to be filtered out by the system since they do not provide any Copyright © 2013, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. useful semantic information. For example, time:n#2 − → clock:v#1 − → certain:a#1 − → stand by:v#2 − → wait:v#1. So, the task is to rank lexical chains and find the best one. In addition to finding the best chain overall, identifying all valid chains between two concepts may be of interest for some applications.
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملExploring Lexical Patterns in Text: Lexical Cohesion Analysis with WordNet
We present a system for the linguistic exploration and analysis of lexical cohesion in English texts. Using an electronic thesaurus-like resource, Princeton WordNet, and the Brown Corpus of English, we have implemented a process of annotating text with lexical chains and a graphical user interface for inspection of the annotated text. We describe the system and report on some sample linguistic ...
متن کاملSemantic Feature Structure Extraction from Documents Based on Extended Lexical Chains
The meaning of a sentence in a document is more easily determined if its constituent words exhibit cohesion with respect to their individual semantics. This paper explores the degree of cohesion among a document's words using lexical chains as a semantic representation of its meaning. Using a combination of diverse types of lexical chains, we develop a text document representation that can be u...
متن کاملAn Automatic Text Summarization Using Lexical Cohesion and Correlation of Sentences
Due to substantial increase in the amount of information on the Internet, it has become extremely difficult to search for relevant documents needed by the users. To solve this problem, Text summarization is used which produces the summary of documents such that the summary contains important content of the document. This paper proposes a better approach for text summarization using lexical chai...
متن کاملWordNet for Italian and Its Use for Lexical Deiscrimination
We present a prototype of the Italian version of WordNet, a general computational lexical resource. Some relevant extensions are discussed to make it usable for parsing: in particular we add verbal selec-tional restrictions to make lexical discrimination eeective. Italian Word-Net has been coupled with a parser and a number of experiments have been performed to individuate the methodology with ...
متن کامل